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Cognitive neuropsychology has championed the use of single-case research design. Recently, 
however, case series designs that employ multiple single cases have been increasingly utilized to 
address theoretical issues using data from neuropsychological populations. In this paper, we 
examine these methodologies, focusing on a number of points in particular. First we discuss the 
use of dissociations and associations, often thought of as a defining feature of cognitive neuropsy- 
chology, and argue that they are better viewed as part of a spectrum of methods that aim to 
explain and predict behaviour. We also raise issues regarding case series design in particular, 
arguing that selection of an appropriate sample, including controlling degree of homogeneity, is criti- 
cal and constrains the theoretical claims that can be made on the basis of the data. We discuss the 
possible interpretation of "outliers" in a case series, suggesting that while they may reflect "noise" 
caused by variability in performance due to factors that are not of relevance to the theoretical 
claims, they may also reflect the presence of patterns that are critical to test, refine, and potentially 
falsify our theories. The role of case series in treatment research is also raised, in light of the fact that, 
despite their status as gold standard, randomized controlled trials cannot provide answers to many 
crucial theoretical and clinical questions. Finally, we stress the importance of converging evidence: 
We propose that it is conclusions informed by multiple sources of evidence that are likely to best 
inform theory and stand the test of time. 
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In the development of theories of cognitive 
processes, cognitive neuropsychology has been 
unusual within cognitive psychology in promoting 



the use of single-case studies (e.g., Caramazza, 
1986). In addition, many researchers within cogni- 
tive neuropsychology have vehemently argued 
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against the use of group studies where data are 

averaged across cases (e.g., Caramazza 8c 
McCloskey, 1988; Sokol, McCloskey, Cohen, & 
Aliminosa, 1991). Nevertheless, it has always 
been the case that research articles would often 
include more than a single case (e.g., 
Warrington, 1975). These cases are examined for 
similarities (associations) and differences (dis- 
sociations) in the patterns of performance that 
are used to provide further evidence in the devel- 
opment of the theory. More recently, the practice 
of including multiple cases has become more 
common, and data are often presented from mul- 
tiple single cases (e.g., Goldrick, Folk, 8c Rapp, 
2010). These case series investigations also share 
the aim of informing cognitive theories, enabling 
us to explain and predict the behaviour of the 
unimpaired and impaired cognitive system. 

Traditionally, theoretical accounts in cognitive 
neuropsychology have been on the basis that a 
general cognitive theory is being tested with 
data from brain-damaged patients on the 
assumption that the theory constrains the poss- 
ible forms of impairment. At a more sophisti- 
cated level, a theory might require certain 
relationships between symptoms. So, for instance, 
the Dell and O'Seaghdha (1991) model predicted 
only a certain set of possible rates of particular 
error types; any error rates outside this set 
would provide evidence against the adequacy of 
the model. Hence, any person with impaired 
word retrieval who produces an error pattern 
outside these limits is a challenge to, and possible 
disproof of, the model. This kind of evidence 
requires just a single patient to challenge the ade- 
quacy of the theory. Nickels and Howard (1995b) 
explored lesions to Dell and O'Seaghdha's (1991) 
model and pointed out that there should be a 
positive correlation between the proportion of 
semantic errors and the proportion of phonologi- 
cal errors that are real words. This prediction of 
the model cannot be tested with a single case. 
Nickels and Howard's (1995b) claim relates to 
the relationship between different kinds of 
errors that should be observed across a case 
series. Indeed, Nickels and Howard found that 
the relationship did not hold. 



Hence, the point we wish to make at the 
outset of this discussion is that a theoretical 
account can be tested both with data from 
single subjects and from case series; neither has 
privileged value, but both are important. We 
therefore wholeheartedly agree with Schwartz 
and Dell (2010) when they argue that case 
series methodology provides a useful complement 
to single-case studies in cognitive neuropsycholo- 
gical and cognitive neuroscience research: Single- 
case and case series methodologies each bring 
unique strengths to the field. We have been pub- 
lishing case series of people with aphasia for more 
than 25 years, in relation to both their patterns of 
performance and their response to intervention 
(e.g., Best, Herbert, Hickin, Osborne, 8c 
Howard, 2002; Best, Hickin, Herbert, Howard, 
8c Osborne, 2000; Howard, Patterson, Franklin, 
Morton, 8c Orchard-Lisle, 1984; Nickels 8c 
Howard, 1994, 1995a, 1995b, 1996). Like 
Schwartz and Dell, we too have argued for the 
relevance of the importance of case series (e.g., 
Howard, 2003). However, we differ from 
Schwartz and Dell in our view of what constitu- 
tes a case series. We believe that a case series is 
indeed a concatenation of multiple single-case 
studies with constraints noted by Schwartz and 
Dell: For this concatenation to be a "true" case 
series, it should include data on the same tasks 
and materials. It is also the case that (by neces- 
sity) the cognitive analysis may often be more 
restricted than for single-case reports. Schwartz 
and Dell argue that case series design primarily 
aims to examine covariance across measures of 
cognitive performance and understand the cogni- 
tive mechanisms underlying this covariance. 
While this is a potential aim, we feel that the 
term "case series design" can be applied more 
widely, with case series reports of this type repre- 
senting a subset of the larger set of case series 
used to inform theoretical debate. 

Dissociations and associations 

Both associations and dissociations in performance 
are interesting and important in contributing to 
theory development: Here, we agree, once again. 
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with Schwartz and Dell (2010). Critically, both 
dissociations and associations can be examined 
using single-case and case series methodology. 
Nevertheless, in the literature there has been a ten- 
dency for single-case studies to be linked to the 
methodology of dissociations and case series to 
associations (e.g., Patterson & Plaut, 2009). We 
do not dispute that cognitive neuropsychology 
may have paid less attention to associations than 
dissociations in the past. Certainly, descriptions 
of cognitive neuropsychological logic have often 
focused on the importance of the "double dis- 
sociation" in single-case studies (e.g., Coltheart, 
2002; Shallice, 1988). While clearly a powerflil 
methodology, in fact single cases have also been 
used to inform theoretical models using methods 
other than the double dissociation. Schwartz and 
Dell provide the example of Warrington (1975) 
who provided associations between performance 
on words and pictures and a dissociation with 
other nonsemantic language tasks to demonstrate 
the profile of semantic memory impairment. 
Hence, a cornerstone of the single-case approach 
is the use of converging evidence for associations 
and dissociations across tasks. However, critically, 
the single-case approach goes beyond measures of 
accuracy to examine error types and qualitative 
differences between individuals. For example, it 
is not merely that M.F. (Best, 1996) is poor at 
picture naming that is of interest. It is also the 
fact that he produces formal paraphasias (errors 
that are real words that share phonemes with the 
target, e.g., a CART named as "card") that is 
used as support for models of language production 
in which there is feedback from phonological to 
lexical representations (Goldrick, 2006). 
Similarly it is not that M.K. (Howard & 
Franklin, 1988) makes errors in spoken picture 
naming that is critical for informing theoretical 
accounts, but that he produces "orthographic" 
errors. For example, when naming a picture of an 
irregularly spelled word, such as a picture of a 
BEAR, M.K. may produce "beer" — a regularized 
pronunciation of the target: It is the occurrence 
of these errors that led Howard and Franklin to 
propose a "graphemic-to-orthographic" conver- 
sion route that converts graphemes (activated via 



the orthographic output lexicon) to input ortho- 
graphic codes (which can then be read aloud 
using the sublexical grapheme -phoneme conver- 
sion route). These are data that depend on a 
single (unusual) case, but, we want to argue, they 
are relevant for the understanding of the cognitive 
architecture of language processing in people 
without language impairment. That depends on 
appreciating that single cases are important and 
should not be discounted (as "outliers" or "excep- 
tions"); those that use correlational methods 
must believe in the reliability of the differences 
between their subjects and, therefore, cannot 
lightly dismiss data from single subjects as excep- 
tions. Correlational analyses necessarily require 
(some degree of) confidence in the differences in 
the levels of performance of participants in the 
constituent tasks — people interested in associ- 
ations and those interested in dissociations will 
both, necessarily, be committed to believing that 
their data reveal real differences between partici- 
pants (although subject to statistical noise). The 
consequence is that case series can yield data 
about both associations and dissociations. 

This is not to say that every apparent dissociat- 
ing case is a true dissociation. There are situations 
where it is possible that an apparent dissociation 
results from, for example, the use of an alternative 
strategy. Blind acceptance of dissociating cases is 
no better than blind rejection. Data on these 
cases must be robust and replicable to exclude 
the possibility of alternative strategies or noisy 
data (e.g., from "bad days"). Nevertheless, we 
would argue that even a single "true" dissociating 
case is important when evaluating associations. 
We illustrate this using the example cited by 
Schwartz and Dell (2010) of the phonological 
deficit hypothesis for phonological dyslexia. 

Patterson and Plaut (2009) report an association 
in patients with acquired phonological dyslexia, 
who are poor at reading nonwords. Such individ- 
uals are poor at producing and/or judging infor- 
mation about the sounds of written letter strings, 
and the typical association is with deficits in phono- 
logical tasks that do not involve written-word 
stimuli (e.g., segmentation and blending tasks). 
Patterson and Plaut (2009) therefore argued that 
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a phonological impairment was the cause of phono- 
logical dyslexia. In a test of this hypothesis, Crisp 
and Lambon Ralph (2006) found a correlation 
between nonword reading accuracy and nonreading 
phonological skills in a group of individuals with 
nonword reading impairments. Their conclusion 
is that "phonological"^ impairments (in the 
absence of semantic impairments) result in phono- 
logical dyslexia, with specifically, the extent of the 
nonword reading impairment being linked to the 
extent of the "phonological" impairment. 

Howard and Nickels (2005) report two cases, 
M.M.G. and H.B., both of whom show impair- 
ments in nonword repetition, digit span, and phono- 
logical manipulation tasks (e.g., segmenting the first 
sound from a spoken nonword). Thus these two 
individuals have the defining features of phonologi- 
cal impairment described by Crisp and Lambon 
Ralph (2006). The prediction from the phonological 
deficit account of acquired phonological dyslexia is 
that H.B. and M.M.G. should be phonological dys- 
lexics with impaired nonword reading. M.M.G. 
does indeed show this pattern, but H.B. shows 
nonword reading performance that is within 
normal limits for accuracy and latency. This is 
despite the fact that his ability to perform phonolo- 
gical tasks is as severely impaired as that of M.M.G. 
and many other phonological dyslexics (e.g., 
Patterson ScMarcel, 1992). Hence, H.B.'s dissociat- 
ing pattern provides clear evidence against the 
hypothesis that acquired "phonological" impair- 
ments necessarily cause nonword reading impair- 
ments.^ Indeed, Howard and Nickels (2005) 
suggest that the association could have a causal 
relationship in the reverse direction. Unimpaired 
subjects use orthography to support their perform- 
ance in tasks requiring "phonological" manipulation, 
while those participants with phonological dyslexia 
and poor nonword reading wUl have a reduced 



support. Hence impaired orthographic skills may 
result in impairment on phonological manipulation 
task: the reverse pattern to that claimed by the pho- 
nological deficit hypothesis. In summary, it remains 
the case that a single dissociating case can challenge a 
hypothesis made on the basis of an association and 
should not be cast aside as an exception. 

A parallel situation arises in the anatomical case 
series that are discussed by Schwartz and Dell 
(2010). They describe data from Schwartz et al. 
(2009) who found brain regions associated with 
semantic errors in naming when controlling for 
comprehension: the left anterior temporal lobe 
and particularly the left mid to anterior middle tem- 
poral gyrus. They claim that damage to this area is 
associated with postsemantic, prephonological 
generation of semantic errors. However, once 
again, if there is just one individual with postse- 
mantic errors who does not have a lesion in the 
anterior temporal lobe (or wherever may be 
claimed) then such a lesion is not necessary for post- 
semantic errors. And, conversely, if there is one 
individual without postsemantic errors who has a 
lesion in this area then it is not sufficient for these 
errors. That is, just as in behavioural studies con- 
cerned with cognitive theories, anatomical 
studies, concerned with localization of cognitive 
processes, also require converging evidence from 
both associations and dissociations, and the latter 
can call into question conclusions from the former. 

Sampling and heterogeneity in case series 

A further important issue in case series investigations 

is sampling. This is particularly pertinent when 
associations are the tool used to evaluate theoretical 
accounts. To illustrate our point, we return to the 
work of Crisp and Lambon Ralph (2006) who 
chose individuals with symptoms of phonological 



^ We refer to "phonological" in quotation marks, in order to indicate that this refers to phonology as indexed by the abihty to 
perform tasks involving phonological manipulation (e.g., segmentation, blending) and not necessarily to, for example, a phonological 
impairment that results in phonological errors in word production. 

^ We acknowledge the situation may be different in development where poor phonological skills are a strong predictor of dyslexia 
in longitudinal studies (Muter, Hulme, SnowUng, 8c Stevenson, 2004). In this field, there are several reports of cases with normal 
phonological processing and poor reading (often developmental surface dyslexia, e.g.. Castles 8c Coltheart, 1996) and normal readers 
of nonalphabetic languages with poor phonological manipulation skills (e.g.. Read, Yun-Fei, Hong-Yin, 8c Bao-Qing, 1986). 
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or deep dyslexia (defined by a lexicality effect, image- 
ability effect, or semantic errors in reading aloud). 
Given that their primary aim was to examine the 
characteristics of phonological/deep dyslexia, this 
was an entirely appropriate sample. However, as dis- 
cussed above. Crisp and Lambon Ralph also used the 
data to argue that the cause of the reading impair- 
ment is the underlying phonological (and semantic) 
impairment. They state, for example, "The data from 
this study suggest that poor nonword reading is 
directly related to the underlying phonological 
impairment," (p. 359) and "Assessments of phonolo- 
gical and semantic impairments suggested that the 
integrity of these primary systems underpinned the 
patients' reading performance" (p. 348). However, 
in order to make these claims, a rather different 
sample is required: a sample of individuals who are 
chosen because they are impaired in semantics 
and/or phonology regardless of the characteristics 
of their reading. Such a sample may show different 
patterns to that of a sample of individuals with pho- 
nological/deep dyslexia. Moreover, such a sample 
could include individuals, like H.B. (Howard & 
Nickels, 2005) described above, whose pattern of 
performance calls into question the hypothesis 
being addressed. 

It is clear that choosing an appropriate sample is 
not as straightforward as it might at first appear. 
Great care must be taken in ensuring appropriate 
sampling for case series, and the conclusions drawn 
on the basis of case series data should be limited to 
those that the sample can truly support. 
Researchers need to define their theoretical question 
and determine the characteristics of the sample 
needed to investigate this question. The importance 
of sampling brings us to a related issue, that of het- 
erogeneity in case series and its control. 

Heterogeneity 

Schwartz and Dell (2010) suggest that one of the 

important differences between group designs and 
case series designs is in the treatment of hetero- 
geneity. Heterogeneity is avoided in group 



studies where it is viewed as a problem: The 
mean of a group is taken to be a good estimate 
of the mean of that population from which that 
group is drawn; any deviations from the mean 
for the individual subjects are taken to be measure- 
ment error or individual differences without inter- 
est (just variation about the mean). In contrast, in 
case series, Schwartz and Dell argue that hetero- 
geneity is vital, and that it is through exploration 
of the causes of heterogeneity that case series can 
advance theory. However, it is important to con- 
sider exactly what we mean here by heterogeneity 
and what the underlying assumptions might be. 
In particular it is important to draw the distinction 
between heterogeneity in the sense of variability in 
the dependent variable and heterogeneity meaning 
variability in other aspects of performance such as 
level of impairment. 

We agree that variability in the dependent vari- 
able is essential: In order to perform meaningful 
correlations, it is important that there be variability 
in the factor of interest. For example, if every indi- 
vidual had a naming accuracy of 30%, it would be 
impossible to investigate which factors affected 
the success of word retrieval across the case 
series.'' Nevertheless, other forms of heterogeneity 
in the dependent variable may not be as desirable. 
For example, naming accuracy may be affected by 
attentional factors or visual processing impair- 
ments. This would result in variability in the depen- 
dent variable, but for reasons that would not inform 
our understanding of word retrieval from the 
mental lexicon. Therefore, there needs to be both 
variability in performance and homogeneity in the 
source of the variability in the dependent variable: 
To achieve this homogeneity, participants need to 
be excluded who have impairments that influence 
the dependent variable but are not related to the 
cognitive process under consideration. 

In a similar way, there needs to be homogeneity 
in other characteristics of the sample. The issue is 
how homogeneous does the sample need to be 
and on which measures? There is no easy answer 



^ However, it would still be possible to examine which factors affected success of naming items within each individual and 
compare this across the case series. 
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to this question, as which factors should be con- 
trolled will vary with the theory and research ques- 
tions, but it is critical that the inclusion and 
exclusion criteria for a sample be both clear and 
theoretically motivated. For example, if the aim of 
an investigation is to examine the factors affecting 
the production of semantic errors in word 
production, what exclusion criteria might be appro- 
priate? In a strictly feed-forward theory of word 
production, phonological processing will have no 
impact on the likelihood of semantic error pro- 
duction. Hence, it may be thought that individuals 
who produce semantic errors but also have phono- 
logical impairments could be sensibly excluded. 
However, in a highly interactive theory where acti- 
vation at later stages of processing feeds back to 
earlier stages of processing, it is possible that 
impairments in phonological processing could 
have an impact on semantic error production. 
Therefore, in order to discriminate between these 
theories, both sets of individuals need to be 
included, and analyses need to be carried out to 
compare them. However, even if the focus is on 
testing a feed-forward theory alone (perhaps in 
the absence of an alternative theory), there is an 
argument for including individuals who produce 
semantic errors and have phonological impair- 
ments: It is possible that phonological processes 
will be found to have an impact on the likelihood 
of semantic error production, which, in turn, 
would refiite the feed-forward theory. 



Summary: Sampling and heterogeneity in case series 
in vestigations 

Sampling is a complex issue in case series and one 
that perhaps has not received the attention it 
deserves in the literature. The participants selected 
will constrain the theoretical claims that can be 
made; particular care must be taken to ensure 
claims do not go beyond those that are warranted 
by the sample. 

When selecting participants for case series 
investigation, it is important that: 

• The case series should have a clear theoretical 
motivation. 



• The hypotheses to be tested are used to con- 
strain the suitable participants (and in turn the 
theoretical claims that are able to be supported 
by the data). 

• There is no evidence of input impairments at a 
prior level of processing that would affect critical 
measures (e.g., unrelated visual input impair- 
ments when investigating reading). 

• There is no evidence of output impairments 
below the level of processing being investigated 
that would affect critical measures (e.g., motor 
speech impairments when investigating word 
retrieval). 

• Performance is above floor and below ceiling to 
enable sensitivity to changes in the variable 
under consideration. 

• When case series are used to test a causal 
relationship, sampling includes individuals on 
the basis of each associated impairment 
independently. 

Heterogeneity and the treatment of outliers 

In their discussion of heterogeneity, Schwartz and 
Dell (2010) note that when case series analysis is 
used to test a model, each case can be characterized 
as consistent with the model or not. In other 
words, the data can be examined and outliers 
detected. This raises the question of how one is 
to determine what is an outlier. 

It is important that statistics be brought to bear 
in the examination of whether individuals in a case 
series are showing equivalent effects. For example. 
Best et al. (2002) examined the effects of phonolo- 
gical and orthographic cues on word retrieval. A 
homogeneity test (Leach, 1979) was used to 
demonstrate that there were differences across 
the individuals in the case series in the effective- 
ness of cues on word retrieval. In contrast, using 
the same test, Howard, Hickin, Redmond, 
Clark, and Best (2006) demonstrated that there 
was no significant difference in the effects of 
word-picture matching on subsequent naming 
across the individuals in their case series. This 
was despite performance at posttest (where 
pretest was zero) varying from 13% to 88% 
correct. Similarly, Best (2005) found no significant 
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difference in the therapy effect for children 
included in a case series intervention study, 
suggesting that, despite the children's different 
language and learning profiles, the intervention 
was working in the same way. 

Assuming that outliers are demonstrated to 
differ statistically from the rest of the cases, then 
they may be of two types: An outlier may be a real 
example of a dissociating pattern that can falsify a 
theory. For example. Castles, Bates, and 
Coltheart (2006) examined reading performance 
of a large sample of children unselected for 
reading ability (2,066 developing readers and 
1,026 older readers). Most children showed 
similar performance on reading of irregular words 
and nonwords as would be expected given the 
high correlation found between these two abilities. 
However, a small proportion (10-11%) showed a 
pattern of developmental surface dyslexia, where 
nonword reading was within normal limits (less 
than half a standard deviation below the mean), 
and irregular word reading was severely impaired 
(more than 1.64 standard deviations below the 
mean). In addition, there were outliers in the 
other direction, developmental phonological dys- 
lexics, where nonword reading was poor, and irre- 
gular word reading was unimpaired. Castles et al. 
then used these outliers as evidence against a con- 
nectionist model of reading acquisition (Harm & 
Seidenberg, 1999), which is unable to simulate 
pure cases of surface and phonological dyslexia. 

An alternative to "real" outliers occurs when out- 
liers are due to an additional confounding factor. 
Take, for example, a hypothetical study that exam- 
ined the relationship between word retrieval and 
semantic processing as measured by picture naming 
and spoken word-picture matching respectively. If 
an individual had an auditory processing impair- 
ment, this would "artificially" lower the individual's 
score on word-picture matching such that it no 
longer represented a "true" measure of their semantic 
processing. Consequently, this individual may 
appear as an outher in terms of the general trend 
for poorer picture naming to be correlated with 
poorer word-picture matching. Schwartz and Dell 
(2010) propose that explanations for outliers can be 
sought through additional assessment. We agree 



that the explanation could be found by additional 

assessments but disagree strongly that such assess- 
ments should be reserved for those who deviate 
from the trends. Such a fishing expedition runs the 
risk of providing a post hoc explanation that may 
prove to be false when all tasks are carried out by 
all participants. For example, as a result of additional 
testing, one may claim that participant X doesn't 
accord with our hypothesis because she or he has 
characteristic Y. However, it may turn out that one 
or some or many of the participants who accord 
with the hypothesis also show characteristic 
Y. Hence, any additional testing must be performed 
with all of the individuals in the case series in order 
for such a claim to be made. We would advocate 
that it is vital that case series analyses be combined 
with fiiU cognitive characterization. 

Case series in investigations of the effects of 
treatment 

One important area that was not covered by 
Schwartz and Dell (2010) in their consideration of 
case series as a methodology was the use of case 
series in intervention studies. Despite the fact that 
randomized controlled trials (RCTs) are the gold 
standard for treatment efficacy, it remains a fact 
that RCTs, just like any group study, involve aver- 
aging across participants, and hence a positive 
result means that "on average" a treatment is effec- 
tive. It is often the case that the treatment is not 
effective for every individual, or that the overall 
size of the effect is very small (but significant 
through the size of the group). The problems with 
RCTs in the field of aphasia have long been 
acknowledged (e.g., Hegde, 2007; Howard, 1986), 
and single-cases/case series designs (e.g., Perdices 
&Tate, 2009; Tate et al., 2008; Thompson, 2006) 
are widely used to evaluate the efficacy of treatment 
at the level of the individual. More recently, these 
designs (labelled as n of 1 randomized controlled 
trials) have begun to become more widespread 
even in medicine (e.g., Guyatt et al., 1990). 
However, once again, there are limitations to what 
the single-case study design can tell us regarding 
treatment, and replication across a series of single 
cases is recognized as important. It is only through 
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case series that we can determine which factors are 
important in the efficacy of treatment, by, once 
again, searching for associations and dissociations 
between the dependent variable (here response to 
treatment) and other measures (e.g., those reflecting 
the nature of the impairment). For example, as 
described above, Best et al. (2002) examined the 
effects of phonological and orthographic cues as 
facilitators of word retrieval across a case series of 
people with aphasia. They analysed the effects of 
cueing for each participant individually and then 
used the data from all the individuals to investigate 
the mechanisms underlying the effectiveness of the 
cues. Through a series of correlations, they deter- 
mined that severity of the naming impairment did 
not relate to effectiveness, but that success of ortho- 
graphic cueing was predicted by accuracy of 
nonword reading (specifically the accuracy of the 
first letter or phoneme). In contrast, success of pho- 
nemic cues did not correlate with accuracy of 
nonword repetition. On the basis of these data. 
Best et al. were able to draw conclusions regarding 
the mechanisms by which orthographic and phono- 
logical cues were operating and constrain theories of 
language production. 

Hence, both clinically and theoretically, treat- 
ment studies can benefit from case series method- 
ology with the same constraints as those outlined 
above for studies not involving treatment (see for 
further discussion, Howard, 2000, 2003). 

Converging evidence from diflferent designs 

As we noted above, Schwartz and DeU (2010) 
argue that case series are important as an alterna- 
tive and complement to single-case study design. 
We would like to go further and argue that 
single cases and case series both play an essential 
part of theory building. We would argue against 
a slow drift away from the depth and rigour that 
a well designed and executed single-case study 
can bring to theoretical debate: In case series, 
inevitably, the reported data are less complete. 
The advantage of good single-case studies is that 
multiple converging sources of evidence from 
different experimental paradigms can be brought 
to bear on a single theoretical issue. While, in 



principle, this is possible in case series, it does 
not happen in practice (for good practical reasons 
that include time, participant compliance, etc.). 

It is our view that consideration of data from mul- 
tiple experimental designs (including case series and 
single cases; treatment and nontreatment) and from 
multiple sources (both intact and impaired popu- 
lations; developmental and acquired disorders; treat- 
ment studies and nontreatment) is most likely to 
inform theories in such a way that they will stand 
the test of time. A recent example is the burgeoning 
interest in phonological neighbours as predicting 
the ease of lexical retrieval. Neighbourhood density 
has been shown to be important in typically develop- 
ing children (Storkel & Lee, 2011), in skilled adult 
readers (Grainger, Muneaux, Farioli, & Ziegler, 
2005), and in children with language impairments 
(German Sc Newman, 2004). These are aU group 
studies. Our primary interest is in the value that can 
be added by detailed investigation with single-case 
and case series studies. Here we find better perform- 
ance in aphasic naming for items with more phonolo- 
gical neighbours in a single case (e.g.. Best, 1995) and 
that several aspects of overlap including initial 
segment contribute independently to the production 
of (nonsemantic) word errors in a case series 
(Goldrick et al., 2010). These findings with strong 
implications for models of typical and impaired 
language production result from carefiil investigation 
of qualitative as well as quantitative aspects of per- 
formance. Similarly, Graham and Hodges (1997) 
provide a rare example of the use of converging evi- 
dence from different designs — ^both group study 
and single-case investigation to inform our under- 
standing of how different brain areas are employed 
in retaining memories over time. Just as in the 
example focusing on neighbourhood size, this 
demonstrates the benefits of combining across 
research designs. 

In addition, it is important that evidence be 
combined across populations. All too frequently, 
researchers focus on one population: unimpaired 
adults, typically developing children, language- 
impaired adults, or language-impaired children. 
The research in each of these fields is often 
guilty of appearing unaware (or even being 
unaware) of that carried out in the other fields. 
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Indeed, even within research involving the same 
populations this can happen. For example, 
research on literacy development and literacy dis- 
orders deriving from psychology and from edu- 
cation take very different perspectives that could 
benefit from dialogue. Similarly, even within cog- 
nitive psychology, research on short-term memory 
is rarely combined with research on language per- 
ception and production (but see Jacquemot & 
Scott, 2006, for an exception). 

In sum, converging evidence has always been a 
cornerstone of cognitive neuropsychology in order 
to determine the level of impairment. However, 
the use of converging evidence should be broader, 
converging across designs and populations. The 
volume edited by Brenda Rapp (2001) ^^The 
Handbook of Cognitive Neuropsychology" is an excel- 
lent example of this approach. Each of the chapters 
explicitly brings together relevant evidence not just 
from cognitive impairments (usually single cases), 
but also from cognitive psychology (usually 
groups), computational modelling, and neuro- 
science, neatly demonstrating how these sources 
of evidence combine to produce a whole greater 
than the sum of the contributing parts. 

Conclusion 

In conclusion, we agree with many but not aU of 
the points about case series raised by Schwartz 
and Dell (2010), and we welcome the opportunity 
to explore these issues. As Hegde (2007) notes, 
"researchers select experimental designs based on 
their training, experience, expertise, and research 
philosophy. And it will continue to be that way. 
Those who typically use a particular strategy will 
retain a healthy critical disposition toward the 
one they do not use. This is good for the 
science . . . because the skeptics of any approach 
will help keep the enthusiasts a notch below extre- 
mists" (p. 30). 

We have discussed issues relating to dissociations 
and associations, which we believe are too often 



thought of as the primary tools of cognitive neuropsy- 
chology. In fact, the identification of associations and 
dissociations are simply two, of a number, of methods 
that aim to explain and predict behaviour. We have 
argued that, for case series, selection of an appropriate 
sample, including controlling type of heterogeneity, is 
critical and constrains the theoretical claims that can 
be made on the basis of the data. We note that oudiers 
in a case series may indeed be "noise" caused by varia- 
bility in performance due to factors that are not of rel- 
evance to the theoretical claims. Alternatively, 
however, these outliers may be those critical, rare 
cases that cognitive neuropsychology has long 
embraced in the testing and development of cognitive 
theories. It is vital that, as case series become more fre- 
quent, these "true" outliers retain the status they 
deserve. We have also discussed the role of case 
series in treatment research and note that, despite 
their status as gold standard, randomized controlled 
trials cannot provide answers to many crucial theor- 
etical and clinical questions. Indeed, Kaptchuk 
(2001) makes the observation "while a gold standard 
is valuable, any worshiping at an altar of a golden calf, 
like the Biblical Exodus story, may obscure 
'reality'. . . . Unless one is aware of a research metho- 
dology's weaknesses, scientific activity may become a 
mechanical ritual" (p. 548). 

The point we should like to focus on is the impor- 
tance of converging evidence: We propose that con- 
clusions based on clearly articulated theory and 
informed by multiple sources of evidence are likely 
to best inform theory and stand the test of time. 
We have also argued that one cannot afford to 
dismiss findings that conflict with a theoretical pos- 
ition as inconvenient or misleading data. The title for 
Patterson and Plaut's (2009) paper was a quotation 
from Alexander Pope's (1711) poem "A Little 
Learning". We adapt the first two lines of this 
poem (with our addition in brackets): 

A little learning is a dang'rous thing, 

drink deep [and firom several sources], or taste not the Pierian 
spring 
(p. 14). 



* In Greek mythology, the Pierian Spring of Macedonia was sacred to the Muses, as the metaphorical source of knowledge of art 
and science. 
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